DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 21

1
Working with a small dataset - semi-supervised dependency parsing for Irish
In: Lynn, Teresa, Foster, Jennifer orcid:0000-0002-7789-4853 , Dras, Mark orcid:0000-0001-9908-7182 and van Genabith, Josef orcid:0000-0003-1322-7944 (2013) Working with a small dataset - semi-supervised dependency parsing for Irish. In: Fourth Workshop on Statistical Parsing of Morphologically Rich Languages, 18 Oct 2013, Seattle, WA. USA. (2013)
BASE
Show details
2
Working with a small dataset - semi-supervised dependency parsing for Irish
Lynn, Teresa; Foster, Jennifer; Dras, Mark. - : Stroudsburg, PA : Association for Computational Linguistics, 2013
BASE
Show details
3
Detecting grammatical errors with treebank-induced, probabilistic parsers
Wagner, Joachim. - : Dublin City University. School of Computing, 2012
In: Wagner, Joachim orcid:0000-0002-8290-3849 (2012) Detecting grammatical errors with treebank-induced, probabilistic parsers. PhD thesis, Dublin City University. (2012)
Abstract: Today's grammar checkers often use hand-crafted rule systems that define acceptable language. The development of such rule systems is labour-intensive and has to be repeated for each language. At the same time, grammars automatically induced from syntactically annotated corpora (treebanks) are successfully employed in other applications, for example text understanding and machine translation. At first glance, treebank-induced grammars seem to be unsuitable for grammar checking as they massively over-generate and fail to reject ungrammatical input due to their high robustness. We present three new methods for judging the grammaticality of a sentence with probabilistic, treebank-induced grammars, demonstrating that such grammars can be successfully applied to automatically judge the grammaticality of an input string. Our best-performing method exploits the differences between parse results for grammars trained on grammatical and ungrammatical treebanks. The second approach builds an estimator of the probability of the most likely parse using grammatical training data that has previously been parsed and annotated with parse probabilities. If the estimated probability of an input sentence (whose grammaticality is to be judged by the system) is higher by a certain amount than the actual parse probability, the sentence is flagged as ungrammatical. The third approach extracts discriminative parse tree fragments in the form of CFG rules from parsed grammatical and ungrammatical corpora and trains a binary classifier to distinguish grammatical from ungrammatical sentences. The three approaches are evaluated on a large test set of grammatical and ungrammatical sentences. The ungrammatical test set is generated automatically by inserting common grammatical errors into the British National Corpus. The results are compared to two traditional approaches, one that uses a hand-crafted, discriminative grammar, the XLE ParGram English LFG, and one based on part-of-speech n-grams. In addition, the baseline methods and the new methods are combined in a machine learning-based framework, yielding further improvements.
Keyword: Artificial intelligence; Computational linguistics; decision tree learning; error corpora; error detection; grammar checker; Language; learner corpus; Linguistics; Machine learning; n-gram language models; natural language processing; precision grammar; probabilistic grammar; ROC curve; voting classifier
URL: http://doras.dcu.ie/16776/
BASE
Hide details
4
Identifying high-impact sub-structures for convolution kernels in document-level sentiment classification
In: Tu, Zhaopeng, He, Yifan, Foster, Jennifer orcid:0000-0002-7789-4853 , van Genabith, Josef orcid:0000-0003-1322-7944 , Liu, Qun and Shouxun, Lin (2012) Identifying high-impact sub-structures for convolution kernels in document-level sentiment classification. In: Annual Meeting of the Association for Computational Linguistics (ACL 2012), 9-11 Jul 2012, Jelu, Korea. (2012)
BASE
Show details
5
Irish treebanking and parsing: a preliminary evaluation
In: Lynn, Teresa, Cetinoglu, Ozlem, Foster, Jennifer orcid:0000-0002-7789-4853 , Uí Dhonnchadha, Elaine orcid:0000-0003-3448-4288 , Dras, Mark orcid:0000-0001-9908-7182 and van Genabith, Josef orcid:0000-0003-1322-7944 (2012) Irish treebanking and parsing: a preliminary evaluation. In: International Conference on Linguistic Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey. (2012)
BASE
Show details
6
Irish treebanking and parsing : a preliminary evaluation
Lynn, Teresa; Çetinoğlu, Özlem; Foster, Jennifer. - : European Language Resources Association (ELRA), 2012
BASE
Show details
7
Decreasing lexical data sparsity in statistical syntactic parsing - experiments with named entities
In: Hogan, Deirdre, Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef orcid:0000-0003-1322-7944 (2011) Decreasing lexical data sparsity in statistical syntactic parsing - experiments with named entities. In: Multiword Expressions: from Parsing and Generation to the Real World (MWE). Workshop at ACL 2011, 19-24 June 2011, Portland, Oregon. (2011)
BASE
Show details
8
Comparing the use of edited and unedited text in parser self-training
In: Foster, Jennifer orcid:0000-0002-7789-4853 , Cetinoglu, Ozlem, Wagner, Joachim orcid:0000-0002-8290-3849 and van Genabith, Josef orcid:0000-0003-1322-7944 (2011) Comparing the use of edited and unedited text in parser self-training. In: The 12th International Conference on Parsing Technologies (IWPT 2011), 05-07 Oct 2011, Dublin, Ireland. ISBN 978-1-932432-04-6 (2011)
BASE
Show details
9
From news to comment: Resources and benchmarks for parsing the language of web 2.0
In: Foster, Jennifer orcid:0000-0002-7789-4853 , Cetinoglu, Ozlem, Wagner, Joachim orcid:0000-0002-8290-3849 , Le Roux, Joseph, Nivre, Joakim, Hogan, Deirdre and van Genabith, Josef orcid:0000-0003-1322-7944 (2011) From news to comment: Resources and benchmarks for parsing the language of web 2.0. In: The 5th International Joint Conference on Natural Language Processing (IJCNLP), 08-13 Nov 2011, Chiang Mai, Thailand. ISBN 978-974-466-564-5 (2011)
BASE
Show details
10
#hardtoparse: POS tagging and parsing the twitterverse
In: Foster, Jennifer orcid:0000-0002-7789-4853 , Cetinoglu, Ozlem, Wagner, Joachim orcid:0000-0002-8290-3849 , Le Roux, Joseph, Hogan, Stephen, Nivre, Joakim, Hogan, Deirdre and van Genabith, Josef orcid:0000-0003-1322-7944 (2011) #hardtoparse: POS tagging and parsing the twitterverse. In: The AAAI-11 Workshop on Analyzing Microtext, 8 Aug 2011, San Francisco, CA. (2011)
BASE
Show details
11
Improving dependency label accuracy using statistical post-editing: A cross-framework study
In: Cetinoglu, Ozlem, Bryl, Anton, Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef orcid:0000-0003-1322-7944 (2011) Improving dependency label accuracy using statistical post-editing: A cross-framework study. In: International Conference on Dependency Linguistics (DepLing), 5-7 Sept 2011, Barcelona, Spain. (2011)
BASE
Show details
12
LFG without C-structures
In: Cetinoglu, Ozlem, Foster, Jennifer orcid:0000-0002-7789-4853 , Nivre, Joakim, Hogan, Deirdre, Cahill, Aoife orcid:0000-0002-3519-7726 and van Genabith, Josef orcid:0000-0003-1322-7944 (2010) LFG without C-structures. In: the 9th International Workshop on Treebanks and Linguistic Theories, 3 - 4 Dec. 2010, Tartu Estonia. (2010)
BASE
Show details
13
Handling unknown words in statistical latent-variable parsing models for Arabic, English and French
In: Attia, Mohammed, Foster, Jennifer orcid:0000-0002-7789-4853 , Hogan, Deirdre, Le Roux, Joseph, Tounsi, Lamia and van Genabith, Josef orcid:0000-0003-1322-7944 (2010) Handling unknown words in statistical latent-variable parsing models for Arabic, English and French. In: SPMRL 2010 - 1st Workshop on Statistical Parsing of Morphologically-Rich Languages at NAACL HLT 2010, 5 June 2010, Los Angeles, CA, USA. (2010)
BASE
Show details
14
Handling Unknown Words in Statistical Latent-Variable Parsing Models for Arabic, English and French
In: Proceedings of the First Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2010) ; First Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2010) ; https://hal.archives-ouvertes.fr/hal-00702414 ; First Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2010), 2010, United States. pp.67-75 (2010)
BASE
Show details
15
Judging grammaticality: experiments in sentence classification
In: Wagner, Joachim orcid:0000-0002-8290-3849 , Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef orcid:0000-0003-1322-7944 (2009) Judging grammaticality: experiments in sentence classification. CALICO Journal, 26 (3). pp. 474-490. ISSN 0742-7778 (2009)
BASE
Show details
16
Adapting a WSJ-trained parser to grammatically noisy text
In: Foster, Jennifer orcid:0000-0002-7789-4853 , Wagner, Joachim orcid:0000-0002-8290-3849 and van Genabith, Josef (2008) Adapting a WSJ-trained parser to grammatically noisy text. In: ACL-08:HLT - 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 15-20 June 2008, Columbus, USA. (2008)
BASE
Show details
17
Parser evaluation and the BNC: evaluating 4 constituency parsers with 3 metrics
In: Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef (2008) Parser evaluation and the BNC: evaluating 4 constituency parsers with 3 metrics. In: LREC 2008 - Sixth International Conference on Language Resources and Evaluation, 28-30 May 2008, Marrakech, Morocco. (2008)
BASE
Show details
18
Parser-based retraining for domain adaptation of probabilistic generators
In: Hogan, Deirdre, Foster, Jennifer orcid:0000-0002-7789-4853 , Wagner, Joachim orcid:0000-0002-8290-3849 and van Genabith, Josef (2008) Parser-based retraining for domain adaptation of probabilistic generators. In: INLG 08 - 5th International Natural Language Generation Conference, 12-14 June 2008, Salt Fork, Ohio, USA. (2008)
BASE
Show details
19
C-structures and f-structures for the British national corpus
In: Wagner, Joachim orcid:0000-0002-8290-3849 , Seddah, Djamé, Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef (2007) C-structures and f-structures for the British national corpus. In: Lexical Functional Grammar 2007, 28-30 July 2007, California, USA. (2007)
BASE
Show details
20
Adapting WSJ-trained parsers to the British national corpus using in-domain self-training
In: Foster, Jennifer orcid:0000-0002-7789-4853 , Wagner, Joachim orcid:0000-0002-8290-3849 , Seddah, Djamé and van Genabith, Josef (2007) Adapting WSJ-trained parsers to the British national corpus using in-domain self-training. In: IWPT 2007 - 10th International Conference of Parsing Technology, 23-24 June 2007, Prague, Czech Republic. (2007)
BASE
Show details

Page: 1 2

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
21
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern